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Abstract 

The paper describes the implementation and experimental validation of a new direct 
adaptive control scheme on a PUMA 560 industrial robot. The testbed facility consists of a 
Unimation PUMA 560 six-jointed robot and controller, and a DEC Micro VAX II computer 
which hosts the RCCL (Robot Control “C” Library) software. The control algorithm is 
implemented on the MicroVAX which acts as a digital controller for the PUMA robot, 
and the Unimation controller is effectively bypassed and used merely as an I/O device to 
interface the MicroVAX to the joint motors. The control algorithm for each robot joint 
consists of an auxiliary signal generated by a constant-gain PID controller, and an adaptive 
position-velocity (PD) feedback controller with adjustable gains. The adaptive independent 
joint controllers compensate for the inter-joint couplings and achieve accurate trajectory 
tracking without the need for the complex dynamic model and parameter values of the 
robot. Extensive experimental results on PUMA joint control are presented to confirm the 
feasibility of the proposed scheme, in spite of strong interactions between joint motions. 
Experimental results validate the capabilities of the proposed control scheme. The control 
scheme is extremely simple and computationally very fast for concurrent processing with 
high sampling rates. 

1. Introduction 

During the past decade, the control of robot manipulators has been the focus of consid- 
erable research, and many different control schemes have been suggested in the literature. 
With a few exceptions, all existing schemes are tested on manipulators through computer 
simulations only , often using the popular two-link arm paradigm. Although the simulations 
are useful for illustration and proof-of-concept, practical issues such as effect of friction, 
limitation on controller gains, and sampling rate constraint are often neglected. Despite the 
large number of proposed manipulator control schemes, the number of schemes that have ac- 
tually been experimentally evaluated on manipulators, and particularly on industrial robots, 
is very small today. 

This paper describes the implementation and experimental validation of a newly de- 
veloped direct adaptive control scheme [1,2] on a six-jointed PUMA 560 industrial robot. 
The control scheme has a decentralized structure and consists of a number of simple local 
feedback controllers. Each local controller consists of an auxiliary signal generated by a 
constant-gain PID controller, and an adaptive position-velocity (PD) feedback controller 


whose gains are updated on-line in real time. The control scheme is implemented on a Mi- 
cro VAX II computer using the RCCL software, and generates the control signals that drive 
the PUMA joint motors directly. This simple control scheme is not based on the complex 
PUMA dynamic model and is therefore implemented at a high sampling rate and yields a 
good tracking performance. 

The paper is structured as follows. In Section 2, an overview of the design theory 
for adaptive joint controllers is given. The descriptions of the testbed facility at JPL and 
the RCCL software are given in Section 3. Section 4 discusses the experimental results 
on simultaneous control of all six joint angles of the PUMA 560 industrial robot. The 
conclusions drawn from the paper are discussed in Section 5. 

2. Theory Overview 

In this section, the design theory for direct adaptive control of manipulators is outlined. 
The proposed control scheme has a decentralized structure, where each manipulator joint is 
controlled independently of the others by use of a local feedback controller. Therefore, the 
dynamics of each joint will be considered separately. 

Consider a robot manipulator with the n joint angles 9 — [9\ ,0 2 > • • • » 6 n ) and the cor- 
responding n joint torques T = [Ti, T 2 » • • • > T n ]. The dynamic model of the X th manipulator 
joint which relates 0i to Ti can be represented by the second-order nonlinear differential 
equation 

n 

mu{0)9i + ^2 m iA 6 )0j + (M) + SiW + hi(9i) = Ti (l) 

i* l 

where m tJ - is the {i,j) th element of the inertia matrix, and n t -, g*, hi are the i ih elements of 
the Coriolis/centrifugal vector, gravity vector, and friction vector, respectively. The terms 
in equation (l) are highly complicated nonlinear functions of the manipulator configuration 
9, the speed of motion 9, and the payload inertial parameters. The manipulator control 
problem is to generate the joint torques Ti(t), for i = 1 ,...,n, such that the joint angles 
0i(t) track some desired trajectories as closely as possible. 

In the proposed decentralized control scheme, the t th joint is controlled by the local 
adaptive feedback control law [2] 

Ti{t) = fi{t ) + k pi {t)ei{t ) + Mt)ei(t) (2) 

as shown in Figure 1, where e*(t) = 9 di (t) - 0i(t) and e t -(f) = 0 di (t) - are the position 
and velocity tracking-errors, and the controller terms are given by 

Weighted tracking-error 

fi(f) = Wpiei(t) + w vi ei(t ) (3) 

Auxiliary signal 

fi{t) = fi{ o) + Si f n{t)dt + P in{t) (4) 

Jo 
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Position feedback gain 


k P i{t) — ^pt(O) ■+■ 


«. r, 

Jo 


(t)ei(t)dt + f}{r i{t)ei{t) 


(5) 


Velocity feedback gain 


k vi {t) = + 7* f ri(t)ei(t)dt + A<r t -(t)e t -(t) (6) 

Jo 

In equations (3)-(6), 7»} are positive scalar integral adaptation gains, {pi, /?,, A,} 

are non-negative scalar proportional adaptation gains, and {wpi, are scalar positive 
weighting factors of the position and velocity tracking-errors. 

From the implementation viewpoint, the auxiliary signal fi(t) can be generated by a 
constant-gain PID feedback controller acting on the position tracking-error e,(i), since from 
equations (3) and (4), fi(t) can be expressed as 


fi(t) = fi{ 0) + pi [u?p<e t (0 + ttf w <ej(0] + Si f [w pi ei{t) -|- dt 

Jo 

= /*(°) + [pi w r>i + £<“>„,] ei(t) -I- [/>,•«;„,•] e f (t) + [<5,u> pt ] / ei(t)dt 

Jo 


(7) 


Hence, the joint torque (2) can be generated by the adaptive PID feedback controller 


Ti(t) = r,(o) + 




[kpi{t) + ha I dt + k vi (t) — ]ei(t) 


( 8 ) 


where kpi(t) = kpi{t) -(- piW pi + Siw v i, kn = SiW pi , and k vi (t) = k vi (t ) + are the 

adjustable PID gains and T t (0) = /t(0) is the initial joint torque. 


The dynamics of manipulator joints are highly coupled, as can be seen from equation 
(1). The local control law (2) for each joint compensates, to a large extent, for the static and 
dynamic cross-couplings that exist between the manipulator joints. For fast simultaneous 
motion of all joints, the inter-joint coupling effects can be significant and may cause insta- 
bility under the decentralized adaptive control scheme (2)-(6). In such cases, the controller 
adaptation laws are modified slightly in order to achieve robust stability in the presence of 
the unmodeled cross-coupling effects. A popular approach is the “<r-modification” method 
[3], which yields the adaptation laws 


fi{t) = fi{0) + Si f ri(t)dt + piTiit) - Oi f fi(t)dt 
Jo Jo 

kpi(t) = k pi ( 0) + o-i f ri(t)ei(t)dt + /?<r t -(f)e,-(t) - f kpi(t)dt 

Jo Jo 

k vi (t) = A:„i(0) + 7 i f ri(t)ei(t)dt + A<r<(0«i(0 ~ °\ f k vi (t)dt 

Jo Jo 


(9) 

( 10 ) 

( 11 ) 
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where <7 t - is a positive scalar design parameter. The a-modified adaptation laws produce a 
non-zero residual tracking-error of 0(y/a { ) but guarantee stability in the presence of inter- 
joint couplings. 

It is seen that the controller adaptation laws (9)-(ll) and the control action (2) are based 
entirely on the observed manipulator performance through 6(t) and 0^(0 rather than on the 
manipulator dynamic model (1). As a consequence, the knowledge of either the complex 
dynamic model and the parameter values of the manipulator or the inertial parameters of 
the payload is not required in the control law formulation. Thus, the adaptive controllers 
can cope with uncertainties or variations in the manipulator or the payload parameters. 
This is a highly desirable feature in practical applications, where some dynamic effects such 
as friction can not be modeled accurately and the payload mass can vary substantially. 
Furthermore, the proposed decentralized control scheme is extremely fast computationally, 
since the controller gains are generated on-line in real time by simple adaptation laws, 
and hence the control action can be evaluated very rapidly. Due to its simplicity and 
decentralized structure, the proposed scheme can be implemented on parallel processors 
for distributed concurrent computing with high sampling rates, yielding improved dynamic 
performance. 


3. Description of Testbed Facility 

In this section, we describe the robotic testbed facility at the Jet Propulsion Laboratory. 

The testbed facility at the JPL Robotics Research Laboratory consists of a six-jointed 
Unimation PUMA 560 robot and controller, and a DEC Micro VAX II computer, as shown 
in the functional diagram of Figure 2. The major components of the Unimation controller 
axe the LSI 11/73 microcomputer, six 6503 microprocessor boards (one per joint), and se- 
rial/parallel interface cards. The Micro VAX II hosts the RCCL (Robot Control “C” Library) 
software, which was developed by Hayward and Lloyd at Purdue and McGill Universities 
[4,5]. The original version runs on a DEC VAX 750 computer, and later the software is 
ported to a DEC Micro VAX II running UNIX 4.3 BSD operating system. The organization 
of the robot software is reflected in the control hierarchy diagram illustrated in Figure 2. 
The complete software resides on two different pieces of computing hardware. A Micro VAX 
computer, which plays the supervisory role, hosts a two-level software written in the “C” 
language. The lower level, called Robot Control Interface (RCI), provides the progr amm er 
a facility to write real-time control procedures. It serves as a substrate to the higher level 
(RCCL), which gets its collective name from the robot software. The higher level con- 
sists of routines to specify a robot trajectory in Cartesian coordinates. The second piece 
of hardware is the Unimation controller hosting an LSI 11/73 processor on which an I/O 
control program called “moper” executes to monitor co mmuni cation between the 6503 joint 
microprocessors and the RCI control level. 

At the lowest level of the hierarchy, robot servoing is achieved by the 6503 joint mi- 
croprocessors. The processor has two different operational modes: position and current. 
In position mode, the 6503 processor accepts a position setpoint from the LSI 11/73 and 
servos to the desired position by executing a control code written in the assembly language 
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that is stored in ROM (this servo code was developed by the Unimation Corporation). In 
this mode, the control task, triggered by a hardware clock, executes approximately at the 
sampling rate of 1 KHz. In current mode, each input is interpreted as current (or torque) 
by the 6503 processor and is simply converted into am analog value to be forwarded directly 
to the power drive electronics. This mode makes possible the implementation of any joint 
control law (e.g. force control and adaptive control) on a remote computer that can inter- 
face with the LSI 11/73. In order to implement the adaptive control algorithm, the joint 
PID servos provided by the resident Unimation code are in effect disconnected by selection 
of the 6503 current mode, and current inputs are supplied directly from the Micro VAX for 
driving the robot. 

At the intermediate level, the LSI 11/73 executes the “moper” communication monitor 
program that transfers data and commands back and forth between the LSI 11/73 and 
the Micro VAX n host computer. The interface between the two processors is a DRVll 
high-speed parallel link for control communication. The moper program synchronizes the 
operation of the entire system as described in the following. A hardware clock located in the 
Unimation controller constantly interrupts the LSI 11/73, and at each interrupt, the moper 
collects data relating to the robot state, such as joint positions, currents, and arm status, 
and transfers it to the MicroVAX via an interrupt. The Micro VAX receives this data and 
immediately sends position or current values that have been computed in the previous cycle 
by the control code to the LSI 11/73 for execution. Available system sampling time is in 
increments of 7 milliseconds, ranging from 7 to 56 milliseconds (18 — 143 Hz). The sampling 
time of 28 milliseconds is best suited for the 6503 position mode and is set as default. In 
our adaptive control implementation, the lowest sampling time of 7 milliseconds (143 Hz) 
is chosen to obtain the best control performance. 

On the MicroVAX, two programs run concurrently: the foreground planning level 
(RCCL) and the background control level (RCI). The planning level executes in the fore- 
ground in the sense that it interacts with the user and performs high-level computations as 
well as communicating with the control level. It has access to standard I/O resources such 
as files, devices, and system calls. At this level, the programmer can specify the task in 
Cartesian coordinate system in terms of homogeneous transformations and define a robot 
end-effector trajectory with a series of via points. Essentially the planning level consists 
of task sequencing, motion planning and queueing, and modeling of the world in terms of 
homogeneous transformations. 

RCI (Robot Control Interface) level executes a series of control routines during each 
sampling period to interface in real-time with the robot. In practice, the control level 
communicates with the planning level only through global external variables (i.e., shared 
memory). Both levels can communicate with the robot through predefined global variables: 
“how,” that contains information describing the state of the arm, and “chg,” that is used 
to control the arm. These variables are used for robot control and are usually accessed at 
the control level. To meet the constraints imposed by the sampling time in the range of 
milliseconds, the control level is executed in the UNIX kernel mode at the highest priority, 
which effectively locks out all hardware interrupts. Once initiated, it quickly loads the 
memory context of the robot control code, performs I/O with moper, and executes the 
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control code and supporting RCI interface routines. Since the control level is in kernel 
mode, it cannot access the usual system calls or I/O facilities. 

The RCCL software was originally designed to control the robot in the 6503 position 
mode (i.e., with the control action provided at the 6503 joint processor level at 1 KHz 
sampling rate). This way the user can concentrate mostly on the programming aspects of 
performing a task rather than the real-time control issues. For this purpose, by default, a 
set of standard real-time control routines is provided to perform functions such as trajectory 
generation, error checking, event synchronization, kinematic computations, and coordina- 
tion with the planning level. In order to implement a different control scheme, however, the 
user selects the 6503 current mode to drive the robot, which in turn necessitates writing 
control procedures essentially to replace the nominal control functions. In addition, the 
user must consider meeting his newly imposed sampling time requirement; more often he 
wants to lower the sampling time to control the robot more effectively. As a consequence, 
smaller and more compact control code must be written to meet the real-time constraint. 
For example, to implement our adaptive controller in the sampling rate of 7 milliseconds 
(lowest rate available in RCCL), the need for a trajectory generator is met by writing a 
simple cycloidal generator that provides smooth series of setpoints without added features 
such as real-time trajectory modification. In addition, the adaptive control algorithm for 
computing the desired motor currents from the observed joint positions is coded in its most 
numerically efficient form. 


4. Experimentation with a PUMA 560 Robot 

In this section, the theoretical results outlined in Section 2 are applied to a six-jointed 
PUMA 560 industrial robot in the testbed facility described in Section 3. 

To test and evaluate the control scheme of Section 2, the adaptive controllers are 
implemented on all six joints of the PUMA 560 robot. The dynamic model for a typical I th 
joint of PUMA can be written as 

6 

m «(0)^t + ^2 m »i% + n »(0> 0) + 9i{0) + hi(0i) — T{ (12) 

3 = 1 


where 0 = and the terms in equation (12) are highly complicated nonlinear 

functions of 0 and 0 as given in [6]. From equation (12), it is seen that the effective inertia 
mu(0), the gravity loading <7t(0), and the Coriolis/centrifugal torque n,(0,0) seen at each 
joint are nonlinear functions of all six joint angles; i.e., the robot configuration and speed. 
Furthermore, there are inertial couplings between joint motions, as indicated by m^9j 
terms, with the coupling factors dependent on the robot configuration. In the adaptive 
control implementation, the i th joint is controlled independently by the local feedback law 

Ti(t) = fi(t) -I- k pi (t)ei(t) + k vi (t)ei(t) (13) 
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where e«(i) = 6di{t) — (t) is the position tracking-error, 0<u(t) is the reference trajectory, and 

[fi,kpi,k v i] are the auxiliary signal, position and velocity feedback gains for the i th joint, 
respectively. It is seen that although the joint dynamics (12) are coupled, the proposed 
control scheme (13) is decentralized, i.e., the control torque does not depend on the joint 
angle 6j for j ^ t. 

In the experiment on adaptive control of PUMA, the sampling period is chosen as the 
smallest possible value T, = 7 milliseconds (i.e., sampling frequency /, = 143 Hz), since the 
on-line computations involved in the adaptive control law (13) are a few simple arithmetic 
operations. The adaptation gains in equations (3)-(6) are selected after a few trial-and-errors 
as * 

w p i = 15 , w v i = 10 , tw P 2 = 40 , w v 2 = 20 , ttf P 3 = 12 , w v3 = 4 

Wp 4 — 3 , U >„4 — 2 , Wp 5 — 3 , UJ „5 2 , Wp 6 ^ 5 ^t )6 2 (14) 

All joints : 6 = 30, a = 100, 7 = 800, p = /3 = X = a = 0 

The initial values of all controller gains are chosen as zero, i.e. k p i{ 0) = /c ut (0) = 0 for 
1 = 1, ... ,6. The initial values of the auxiliary signals are chosen as 

/ 2 ( 0) = 12sgn[0 d2 (7-) - ^ 2 ( 0 )] + 1.02 sin ^ 2 ( 0 ) 

- 8.4 sin[0 2 (O) + 0 3 (0)] - 37.2 cos 0 2 (O) Nt. meter 

/ 3 (0) = 2sgn[^(r) - 0 3 (O)] + 0.25 cos[0 2 (O) + <? 3 (0)] (15) 

- 8.4 sin(0 2 (O) + 0 3 (O)] Nt.meter 

/i(0) =/ 4 (0) = /s(0) = / 6 (0) =0 

In the above expressions, the first term is chosen empirically to overcome the large stiction 
(static friction) present in the joints, and the remaining terms are used to compensate for the 
initial gravity loading [6]. It is important to note that friction and gravity compensations 
are not used separately in addition to the adaptive controllers, and are used merely as the 
initial conditions of the auxiliary signals in order to improve the initial responses of the joint 
angles. Furthermore, no information about the PUMA dynamic model or parameter values 
is used for implementation of the control scheme. 

In the experiment, the PUMA arm is initially at the “zero” position 0(0) = [0, 0, 0, 0, 0, 0] 
with the upper arm horizontal and the forearm vertical, forming configuration. All the 
six joint angles are then commanded to change simultaneously from the zero positions to the 
goal positions 0<i(3) = [60°, —60°, 60°, 60°, —60°, — 60°] in three seconds, and the desired 
trajectories 0<n{t) are synthesized by a cycloidal trajectory generator software in RCCL. 
While the robot is in motion, the readings of the joint encoders at each sampling instant are 
recorded directly from the robot, converted into degrees and stored in a data file. Figures 
3(i)-(vi) show the desired and actual trajectories of all six PUMA joint angles. It is seen 
that each joint angle 0«(t) tracks the desired trajectory 6di{t) very closely despite inter-joint 
couplings. The experimental results demonstrate that adaptive independent joint control of 
the PUMA robot is feasible, in spite of the static and dynamic couplings between the joints. 

* The unit of angle in the control program is “radian,” and hence the numerical values 
of the adaptation gains are large. 
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5. Conclusions 

A decentralized direct adaptive control scheme has been experimentally validated on 
a PUMA 560 robot, where each robot joint is controlled independently by a simple local 
feedback controller at a high sampling rate. This avoids the computational burden of a 
centralized controller, which results in a slower sampling rate and hence degrades the robot 
performance. The experimental results demonstrate that accurate trajectory tracking is 
achieved by a simple control algorithm, without any knowledge of the complex PUMA 
dynamic model. 

Adaptive control is particularly useful in applications (such as in space) where the 
manipulator has long light-weight arms and handles payloads of unknown and heavy weights. 
In such cases, the dynamics of the manipulator is dominated by the inertial terms due to 
the payload. The controller adaptation can then compensate for such terms and provide a 
stable and consistent performance under gross payload variations. 

Finally, it is important to note that the rate of sampling, /», has a central role in the 
performance of any digital control system. In general, the value of f t is dictated largely by 
the amount of on-line computations that need to be performed during each sampling period 
in order to calculate the required control action. The simplicity of the control scheme 
proposed in this paper allows joint servo loops to be implemented with a high s am pling 
rate, yielding improved tracking performance. 
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Figure 3 (iii). Response of Elbow Angle 83 under Adaptive Controller 
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Figure 3 (iv). Response of Wrist Angle 64 under Adaptive Controller 
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